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Distance labeling schemes for trees 
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Abstract 

We consider distance labeling schemes for trees: given a tree with n nodes, label the nodes with 
binary strings such that, given the labels of any two nodes, one can determine, by looking only at 
the labels, the distance in the tree between the two nodes. 

A lower bound by Gavoille et. al. (J. Alg. 2004) and an upper bound by Peleg (J. Graph Theory 
2000) establish that labels must use 0(log 2 n) bits * 1 . Gavoille et. al. (ESA 2001) show that for very 
small approximate stretch, labels use 0 (log n log log n) bits. Several other papers investigate various 
variants such as, for example, small distances in trees (Alstrup et. al., SODA’03). 

We improve the known upper and lower bounds of exact distance labeling by showing that \ log 2 n 
bits are needed and that i log 2 n bits are sufficient. We also give (1 + e)-stretch labeling schemes 
using 0(logn) bits for constant e > 0. (1 + e)-stretch labeling schemes with polylogarithmic label 
size have previously been established for doubling dimension graphs by Talwar (STOC 2004). 

In addition, we present matching upper and lower bounds for distance labeling for caterpillars, 
showing that labels must have size 21ogn — 0(loglog?z). For simple paths with k nodes and edge 
weights in [1, n ], we show that labels must have size l 0 g n _|_ ©(l 0 g £). 
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1 Introduction 


A distance labeling scheme for a given family of graphs assigns labels to the nodes of each graph in the 
family such that, given the labels of two nodes in the graph and no other information, it is possible 
to determine the shortest distance between the two nodes. The labels are assumed to be composed of 
bits, and the goal is to make the worst-case label size as small as possible. Labeling schemes are also 
called implicit representation of graphs [60, 67]. The problem of finding implicit representations with 
small labels for specific families of graphs was introduced in the 1960s [14, 15], and efficient labeling 
schemes were introduced in [42, 53]. Distance labeling for general graphs has been considered since the 
1970/80s [38, 68], and later for various restricted classes of graphs and/or approximate distances, often 
tightly related to distance oracle and routing problems, see e.g. [6]. This paper focuses on distance 
labels for the well studied case of trees. 

Exact distances. In [57] Peleg presented an 0(log 2 n) bits distance labeling scheme for general un¬ 
weighted trees. In [37] Gavoille et al. proved that distance labels for unweighted binary trees re¬ 
quire | log 2 n — O(logn) bits and presented a scheme with l/(log3 — l)logn ~ 1.7log n bits. This 
paper presents a scheme of size | log 2 n + O(logn) and further reduces the gap by showing that 
| log 2 n — O(logn) bits are needed. Our upper bound is a somewhat straightforward application of 
a labeling scheme for nearest common ancestors [7, 8]. 

Approximate distances. Let dist^x, y) denote the shortest distance between nodes x,y in a tree 
T. An r-additive approximation scheme returns a value dist^(x, y), where distr(x,y) < disty(x,y) < 
dist T(x,y) + r. An s-stretched approximation scheme returns a value dist ' T (x,y), where disbr(x, y) < 
dist^(x,r/) < dist T{x,y) ■ s. For trees of height h Gavoille et al. [30, theorem 4] gave a 1-additive 
0(log n log h) bit labeling scheme. However, using an extra bit in the label for the node depth modulo 
2, it is easy to see that any 1-additive scheme can be made exact. Gavoille et al. [30] also gave upper 
and lower bounds of ©(loglog n log n) bits for (1 + 1/log n)-stretched distance. This paper presents a 
scheme of size 0(logn) for (1 + e)-stretch for constant e > 0. Labeling schemes for (1 + e)-stretch with 
polylogarithmic size label have previously been given for graphs of doubling dimension [61] and planar 
graphs [63]. 

Distances in caterpillars and paths. Labeling schemes for caterpillars have been studied for various 
queries, e.g., adjacency [13]. Here we present upper and lower bounds showing that distance labeling 
caterpillars requires 21ogn — ©(loglog n) bits. The upper bound is constructed by reduction to the case 
of weighted paths with k > 1 nodes and positive integer edge weights in [1, n], for which we give upper 
and lower bounds showing that labels must have size logn + ©(log k). 


Problem 

Lower bound 

Upper bound 

Exact, general trees 

I log 2 n 

5 log 2 n 

(1 + c)-stretch, general trees 

©(log n) 

Caterpillars 

2 log n — ©(log log n) 

Weighted paths, k nodes, weights in [1, n] 

ny 1 log n + ©(log k) 


Table 1: Results presented in this paper, e > 0 is a constant. 


1.1 Related work 

Distances in trees with small height. It is known that, for unweighted trees with bounded height 
h, labels must have size ©(log n log h). The upper bound follows from [30, Theorem 2] and the lower 
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bound from [37, Section 3] 2 . In [43] distance labeling for various restricted class of trees, including trees 
with bounded height, is considered, and in [62] another distance labeling scheme for unweighted trees 
using O(lognlogZi) bits is given. 

Small distances in trees. Distances in a tree between nodes at distance at most k can be computed 
with labels of size logn + 0(ky/logn) [44]. In [4] it is shown that size logn + ©(loglogn) are needed 
for labeling schemes supporting both parent and sibling queries. More generally, [4] shows that, using 
labels of size logn + O(loglogn), the distance between two nodes can be determined if it is at most k 
for some constant k, which is optimal for k > 1. In [31, 32] further improvements are given for small 
distances in trees. For k = 1, corresponding to adjacency testing, there is a sequence of papers that 
improve the second order term, recently ending with [5] which establishes that logn + 0(1) bits are 
sufficient. 

Various other cases for trees. Distance labeling schemes for various other cases have been consid¬ 
ered, e.g., for weighted trees [30, 37, 57], dynamic trees [50], and a labeling scheme variation with extra 
free lookup [48, 49]. 

Exact and approximate distances in graphs. Distance labeling schemes for general graphs [6, 
37, 38, 60, 66, 68] and various restricted graphs exist, e.g., for bounded tree-width, planar and bounded 
degree [37], distance-hereditary [34], bounded clique-width [20], some non-positively curved plane [17], 
interval [35] and permutation graphs [12]. Approximate distance labeling schemes, both additive and 
stretched, are also well studied; see e.g., [16, 24, 30, 33, 37, 39, 40, 51, 57, 65]. An overview of distance 
labeling schemes can be found in [6]. 

1.2 Second order terms are important 

Chung’s solution in [18] gives labels of size logn + O(loglogn) for adjacency labeling in trees, which 
was improved to log n + 0(log* n) in FOCS’02 [11] and in [13, 18, 27, 28, 45] to logn+ 0(1) for various 
special cases. Finally it was improved to logn + 0(1) for general trees in FOCS’15 [5]. 

A recent STOC’15 paper [9] improves label size for adjacency in general graphs from n/2+0(log n) [42, 
52] to n/2 + 0(1) almost matching an (n — l)/2 lower bound [42, 52], 

Likewise, the second order term for ancestor relationship is improved in a sequence of STOC/SODA 
papers [2, 3, 10, 28, 29] (and [1, 45]) to ©(loglogn), giving labels of size logn + ©(loglogn). 

Somewhat related, succinct data structures (see, e.g., [22, 25, 26, 54, 55]) focus on the space used 
in addition to the information theoretic lower bound, which is often a lower order term with respect to 
the overall space used. 

1.3 Labeling schemes in various settings and applications 

By using labeling schemes, it is possible to avoid costly access to large global tables, computing instead 
locally and distributed. Such properties are used, e.g., in XML search engines [2], network routing and 
distributed algorithms [21, 23, 64, 65], dynamic and parallel settings [19, 50], graph representations [42], 
and other applications [46, 47, 56, 57, 58]. Various computability requirements are sometimes imposed 
on labeling schemes [2, 42, 46]. This paper assumes the RAM model. 

2 Preliminaries 

Trees. Given nodes u,v in a rooted tree T, u is an ancestor of v and v is a descendant of u, if u is on 
the unique path from v to the root. For a node u of T, denote by T u be the subtree of T consisting of 

2 We thank Gavoille for pointing this out. 
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all the descendants of u (including itself). The depth of u is the number of edges on the unique simple 
path from u to the root of T. The nearest common ancestor (NCA) of two nodes is the unique common 
ancestor with largest depth. Let T[u,v\ denote the nodes on the simple path from u to v in T. The 
variants T(u,v] and T[u,v) denote the same path without the first and last node, respectively. The 
distance between u and v is the number dist(u,v) = \T(u,v]\. We set distroot(u) = dist(c,r), where r 
is the root of T. A caterpillar is a tree whose non-leaf nodes form a path, called the spine. 

Heavy-light decomposition. (From [59].) Let T be a rooted tree. The nodes of T are classified as 
either heavy or light as follows. The root r of T is light. For each non-leaf node v, pick one child w 
where |T U ,| is maximal among the children of v and classify it as heavy; classify the other children of v 
as light. The apex of a node v is the nearest light ancestor of v. By removing the edges between light 
nodes and their parents, T is divided into a collection of heavy paths. Any given node v has at most 
logn light ancestors (see [59]), so the path from the root to v goes through at most logn heavy paths. 

Bit strings. A bit string s is a member of the set {0,1}*. We denote the length of a bit string s by 
|s|, the ztli bit of s by s*, and the concatenation of two bit strings s, s' by s o s'. We say that sq is the 
most significant bit of s and S| s | is the least significant bit. 

Labeling schemes. An distance labeling scheme for trees of size n consists of an encoder e and a 
decoder d. Given a tree T, the encoder computes a mapping ex : V(T) —>• {0,1}* assigning a label to 
each node (T). The decoder is a mapping d : {0,1}* x {0,1}* —>• Z + , where Z + denotes the positive 

integers, such that, given any tree T and any pair of nodes u, v € V(T), d(e(u),e(v)) = dist(tt, v). Note 
that the decoder does not know T. The size of a labeling scheme is defined as the maximum label size 
|eT(it)l over a ll trees T and all nodes u € V(T). If, for all trees T, the mapping ex is injective we say 
that the labeling scheme assigns unique labels. 

3 Distances on weighted paths 

In this section we study the case of paths with k nodes and integral edge weights in [1, n]. The solution 
to this problem will later be used to establish the upper bound for caterpillars. 

3.1 Upper Bound 

Theorem 3.1. There exist a distance labeling scheme for paths with k nodes and positive integral edge 
weights in [1, n] with labels of size logn + 0(log k). 

Proof. We begin by considering the family of paths with k nodes, integral edge weights and diameter 
< n. We shall prove that there exists a distance labeling scheme for this family with labels of size 
log n + log k + O (log log k ). 

So consider such a path, and root it in one of its end nodes, denoted vq. Denote the nodes on the 
path 'Co, ..., Vk-i in order. Let d n = distroot(iq) and note that, by assumption, di < n for all i. We will 
let the label for Vi store the number di+x for some x < n that allows us to represent di + x compactly. 
Since we use the same x for all nodes, we can easily compute the distance between any pair of nodes 
Vi, Vj as |(di + x) — (dj + x)\. 

Since we choose x < n, the largest number stored in a label will be dk + x < 2 n, which can be 
represented with exactly L = [log(2n)~| bits. Divide those L bits to k + 1 segments, whereof k have 
£ = \_L/k\ bits and the last segment contains the remaining bits. The first segment, segment 0, will 
contain the £ least significant bits, segment 1 the following l bits and so on. We will choose x such that 
the representation of di + x has 0s in all the bits in the rtli segment. If we manage to do so, we will be 
able to encode each di + x with L — £ + [log k~\ bits. Indeed, we can use exactly [log k~\ bits to represent 


3 



i, and the next L — l bits to represent di + x where we skip the i’th segment. Prefixing with a string in 
the form of 1 °8f lo 8 fc lll, we get a string from which we can determine the number of bits needed to write 
[logfc] and therefrom the numbers i and di + x. We use this string as the label for u j . The label length 
is L — t + [log Af| + [log [log k]] + 1 = logn + log A + O(loglogA). 

It remains to show that there exist a number x < n as described. In the following we shall, as in 
the above, represent numbers < 2n with L bits that are divided into k + 1 segments whereof the first k 
have size l. For i < k and y < 2n, let a(i, y ) be a function which returns a number z with the following 
properties: 

(i) In z, all bits outside segment i are 0. 

(ii) z + y has only Os in segment i. 

This function is constructed as follows. If y only has Os in segment i, let a(i, y) = 0. Otherwise take the 
representation of y, zero out all bits outside segment i, reverse the bits in segment i and add v to the 
resulting number, where v has a 1 in the least significant bit of segment i and Os in all other positions. 

Note that from (i) it follows that adding z to any number will not change bits in less significant 
positions than segment i. We can now scan through the nodes vq ,... Vk-i, increasing x by adding bits 
to x in more and more significant positions (in non-overlapping segments), as follows: 

• Set x = 0. 

• For i = 1..., k — 1, set x = x + a{i, x + di). 


After iteration i we have that x + di in segment i only has Os, and in the following iterations, Is 
are only added to x in more significant bit positions, meaning that di + x continues to have only Os in 
segment i. Since the segments are non-overlapping, we end up with x < n. 

For the more general family of paths with k nodes and edge weights in [l,n], we simply note that 
the diameter of any path in this family is at most kn. Using the above result thus immediately gives 
us a labeling scheme with labels of size log n + 0(log k ). □ 

3.2 Lower bound 

Theorem 3.2. Labeling scheme for distances on weighted paths with k nodes and edge weights [1 ,n] 
require log n + 12 (log k) bits. 

Proof. Let J- denote the family of paths with k nodes and integral edge weights in [l,n]. We can 
construct all the members of T by selecting (k — 1) different edge weights in the range [l,n], skipping 
the paths which have already been constructed by the reverse sequence of edge weights. With this 
construction we will at most skip half of the paths, and hence |J-"| > \n k ~ l . Let the worst-case label 
size of an optimal distance labeling scheme for such paths have length L. The number of different 
labels with length at most L is N = 2 L+l — 1. We can uniquely represent each of the paths in T 
with the collection of their labels, and hence [J 7 ! < ( J j[). Thus, we have found that \n k ~ l < (^). Since 
(^) < ( Ne/k) k , it follows that ^d-logn < log N— log k+0(l) and hence that L > log?r+log k— 0(1) 
as desired. 

□ 

Combining Theorem 3.2 with Theorem 3.1 we see that distance labels for paths of k nodes with 
integral weights in [l,n] must have length ^dlogn + ©(log A:). 
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4 Distances in caterpillars 

4.1 Upper bound 

Theorem 4.1. There exist a distance labeling scheme for caterpillars with worst case label size 2 log re¬ 
tag tag re + O (tag tag tag re). 

Proof. We will start by giving a simple 2 tag re bits scheme and then improve it. The simple solution 
assigns two numbers to each node. The nodes on the spine save distroot and the number 0. The nodes 
not on the spine save their parent’s distroot and a number that is unique among their siblings. The 
second number is required to distinguish siblings, and hence determine if the distance between two 
nodes is 0 or 2. The worst-case label size for this solution is 2 tag re + 0(1). 

To improve the solution, we split up the nodes on the spine into two groups: (1) nodes with > ^ 
leaves and (2) nodes with < ^ leaves, for some parameter k to be chosen later. We add the root to the 
first group no matter what. Note that the first group can contain at most k nodes. 

As before, all nodes save two numbers: distroot and the number 0 for spine nodes or a number to 
distinguish siblings. The idea is to reduce label size with tag k bits by using fewer bits for the first 
number for nodes in the first group and for the second number for nodes in the second group. 

The nodes in the first group form a path with at most k nodes and edge weights in [l,re] (where 
each weight corresponds to the distance between the nodes in the original graph). The algorithm 
from Theorem 3.1 will add a number x, which is less than the diameter, which again is less than re, 
to the numbers representing the root distances of the nodes. Using this technique, we can, as seen 
in the proof of Theorem 3.1, encode the (modified) distroots of the nodes in the first group with only 
^4 tag re + tag k + 0 (tag tag k) bits. This gives labels of size 2k 7 x tag re + tag k + O (tag tag k) for non-spine 
nodes whose parents are in the first group. 

We will also add x to the distroots of nodes in the second group, but since x < re this will not 
change the label size by more than a single bit. For non-spine nodes whose parents are in the second 
group, we need at most tag re — tag A: + 0(1) bits for the second number, giving a total label size of 
2 tag re — tag k + 0(1). 

Finally, since the two numbers that form a label now have different lengths, we need an additional 
0(loglogA:) bits to determine when one number ends and the next begins. Indeed, it wil be possible 
to split up labels into their components if we know the number of bits used to write [tagA:], and we 
represent this number with 0(loglogA:) bits. 

Setting k = 2 iog S iogn ’ we now see that our worst-case label size is the maximum of 

2 tag re — tag k + 0(log tag k) = 2 tag re — tag tag re + 0(log tag tag re) 


and 


2k - 1 
k 


tag re + tag k + 0(log tag k) 


2 tag re — 2 tag tag re + tag tag re + 0(log tag tag re) 
2 tag re — tag tag re + O (tag tag tag re). 


This proves the theorem. □ 

4.2 Lower bound 

We present a technique that counts tuples of labels that are known to be distinct and compares the 
result to the number of tuples one can obtain with labels of size L. The technique may have applications 
to distance labeling for other families of graphs. 

Theorem 4.2. For any re > 4, any distance labeling scheme for the family of caterpillars with at most 
re nodes has a worst-case label size of at least 2|_logreJ — [log [log ?zj J — 4. 
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Proof. Set k = [lograj and m = 2 k . Let (*i, be a sequence of k numbers from the set {1,..., m/2} 

with the only requirement being that i\ = 1. Now consider, for each such sequence, the caterpillar whose 
main path has length m/2 and where, for t = 1 ,..., k, the node in position i t has \m/2k\ leaf children 
(not on the main path). We shall refer to these children as the t’th group. Note that two disjoint groups 
of children may be children of the same node if q = h for some s,t. Each of these caterpillar has 
m/2 + k[m/2k\ < m < n nodes. 

Suppose that a is a distance labeling scheme for the family of caterpillars, and consider one of 
the caterpillars defined above. Given distinct nodes u, v not on the main path, their distance will be 
dist(ra,u) = |i s — it | + 2, where i s and i t are the positions on the main path of the parents of u and 
v , respectively. In particular, if s = 1, so that i s = 1, then dist(ra,u) = it + 1. Thus, if a has been 
used to label the nodes of the caterpillar, the number i t for a child in the t’th group can be uniquely 
determined from its label together with the label of any of the children from the first group. It follows 
that any /c-tuple of labels (h ,..., Ik) where l t is a label of a child in the t’th group uniquely determines 
the sequence (ii,... ,ik )• In particular, /c-tuples of labels from distinct caterpillars must be distinct. Of 
course, /c-tuples of labels from the same caterpillar must also be distinct, since labels are unique in a 
distance labeling scheme. 

Now, there are {m/2) k ~ l choices for the sequence (*i,..., ik), and hence there are (m/2) fe_1 different 
caterpillars in this form. For each of these, there are \m/2k\ k different choices of /c-tuples of labels. 
Altogether, we therefore have {m/2) k ~ l \m/2k\ k distinct fc-tuples of labels. If the worst-case label 
size of a is L, then we can create at most ( 2 L+1 — l) k distinct /c-tuples of labels, so we must have 
{m/2) k ~ 1 [m/2k\ k < (2 i+1 — l) fc . From this it follow that 


k _ i 

L > I—-—(logm. — 1) + logl m/2/cJJ 

k 

> 7 —— + k - log k\ - 2 
k 

> 2k — [log k\ — 4 

= 2 [log raj — [log [log raj J — 4. □ 

5 Exact distances in trees 

5.1 Upper bound 

Let u , v be nodes in a tree T and let w be their nearest common ancestor. We then have 

dist (u,v) = distroot(ra) — distroot(u) + 2 dist(u;, u) (1) 

If w = u so that u is an ancestor of v, then the above equation is just a difference of distroots, which 
can be stored for each node with logra bits. The same observation clearly holds if w = v. 

Assume now that w ^ { u , u} so that u and v are not ancestors of each other. Consider the heavy- 
light decomposition [59] described in the preliniaries. At least one of the nodes u and v must have an 
ancestor which is a light child of w. Assume that it is v. Now, v has at most logra light ancestors. 
Saving the distance to all of them together with distroot gives us sufficient information to compute the 
distance between u and v using equation (1). This is the idea behind Theorem 5.2 below. 

By examining the NCA labeling scheme from [7, 8], we see that it can easily be extended as follows. 

Lemma 5.1 ([7, 8]). There exists an NCA labeling scheme of size O(logra). For any two nodes u,v the 
scheme returns the label of w = nca(ra, v ) as well as: 

• which of u and v (if any) have a light ancestor that is a child of w; and 
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• the number of light nodes on the path from the root to w and from w to u and v, respectively. 

Theorem 5.2. There exists a distance labeling scheme for trees with worst-case label size ^ log 2 n+ 
O(logn). 

Proof. We use O(logn) bits for the extended NCA labeling in Lemma 5.1 and for distroot. Using (1) 
it now only remains to efficiently represent, for each node, the distance to all its light ancestors. We 
consider the light ancestors of a node v encountered on the path from the root to v. The distance from 
v to the root is at most n — 1 and can therefore be encoded with exactly [logn] bits (by adding leading 
zeros if needed). By construction of the heavy-light decomposition, the next light node on the path to 
v will be the root of a subtree of size at most n/2, meaning that the distance from v to that ancestor 
is at most n/2 — 1 and can be encoded with exactly [logn] — 1 bits. Continuing this way, we encode 
the i’th light ancestor on the path from the root to v with exactly [logn] — i bits. When we run out of 
light ancestors, we concatenate all the encoded distances, resulting in a string of length at most 

1 o 1 

riogn] + (|"log n] — 1) H-f 2 + 1 = - [logn] + - [logn] . 

We can use O(logn) extra bits to encode n and to separate all sublabels from each other. The decoder 
can now determine [logn] and split up the entries in the list of distances. When applying formula (1), 
it can then determine the distance between v and w by adding together the relevant distances in the 
list of light ancestors, using the fact from Lemma 5.1 that it knows the number of light ancestors from 
v to w. □ 

5.2 Lower bound 

In the case of general trees, Gavoille et al [37] establish a lower bound of | log 2 n — O(logn) using 
an ingenious technique where they apply a distance labeling scheme to a special class of trees called 
(h,M)- trees 3 . The following uses a generalization of (h,M)- trees to improve their ideas and leads to a 
lower bound of \ log 2 n — 0( log n). 

( h , IF, a)-trees. We begin with some definitions. For integers h, W > 0 and a number a > 1 such that 
W/a l is integral for all i = (),... ,h, an (h,W, a)-tree is a rooted binary tree T with edge weights in 
[0, IF] that is constructed recursively as follows. For h = 0, T is just a single node. For h = 1, T is 
a claw (i.e. a star with three edges) with edge weights x, x, IF — x for some 0 < x < IF rooted at the 
leaf node of the edge with weight IF — x. For h > 1, T consists of an (1, IF, a)-tree whose two leaves 
are the roots of two ( h — 1, IF/a, a)-trees 7o,Ti. We shall denote an (h, IF, a)-tree constructed in this 
way by T = (T 0 ,Ti,x) An example for h = 3 can be seen in Figure 1. Note that the case a = 1 simply 
corresponds to the (h, IF)-trees defined in [37]. 

It is easy to see that an (/r, IF, a)-tree has 2 h leaves and 3 • 2 h — 2 nodes. Further, it is straightforward 
to see that, if u,v are leaves in an (h, IF,a)-tree T = ( Tq,Ti,x ), then 

f 2IF ~-i h + 2.x , if u G To and v € Tf, or vice versa, 
dist t{u,v) = < 1 “ (2) 

I dist^ (u, v ), if u, v € 7) for some i = 0,1. 

Leaf distance labeling schemes. In the following we shall consider leaf distance labeling schemes 
for the family of (h, IF, a)-trees: that is, distance labeling schemes where only the leaves in a tree need to 
be labeled, and where only leaf labels can be given as input to the decoder. Since an ordinary distance 
labeling scheme obviously can be used only for leaves, any lower bound on worst-case label sizes for a 
leaf distance labeling scheme is also a lower bound for an ordinary distance labeling scheme. We denote 
by g(h , IF, a) the smallest number of labels needed by an optimal leaf distance labeling scheme to label 
all ( h , IF, a)-trees. 

3 Note that their exposition has some minor errors as pointed out (and corrected) in [41] 
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W-x 



Figure 1: An (h, W, a)- tree, where h = 3. We require that x < W, yi,y 2 < W/a and z \,..., z 4 < W/a 2 . 
Lemma 5.3. For all h > 1 and W >2, g(h, W, a) 2 > Wg(h — 1, W 2 /a 2 ,a 2 ). 

Proof. Fix an optimal leaf distance labeling scheme a which produces exactly g(h , W, a ) distinct labels 
for the family of (h, W, a)-trees. For leaves u and v in an (h,W,a)~ tree, denote by l{u) and l(v), 
respectively, the labels assigned by a. For x = 0,..., W — 1, let S(x) be the set consisting of pairs of 
labels (l(u), l(v)) for all leaves u € To and v € T\ in all ( h , W. u)-trees T = (To, T\,x). 

The sets S(x) and S(x') are disjoint for x / x 1 , since every pair of labels in S(x) uniquely determines 
x due to (2). Letting S = U'Lg 1 S'(x), we therefore have |Sj = |'5(^)|- Since S contains pairs 

of labels produced by a from leaves in (h,W,a )-trees , we clearly also have |Sj < g(h,W, a ) 2 , and 
hence it only remains to prove that |Sj > Wg(h — 1, W 2 /a 2 , a 2 ), which we shall do by showing that 
|5(a;)| > g(h — 1, W 2 /a 2 , a 2 ) for all x. 

The goal for the rest of the proof is therefore to create a leaf distance labeling scheme for (h — 
1, W 2 /a 2 ,a 2 )-trees using only labels from the set S(x) for some fixed x. So let x be given and consider 
an (h — 1, W 2 /a 2 , a 2 )-tree T'. Let V = W/a. From T' we shall construct an (.h — 1,V, a)-tree <t>i(T') 
for i = 0,1 such that every leaf node v in T' corresponds to nodes <f>i(v) in <f>i(T') for i = 0,1. The 

trees <j>i(T') are defined as follows. If h = 1, so that T' consists of a single node, then 4>i(T r ) = T' 

for i = 0,1. If h > 1, then T’ is in the form T 1 = (T/. T [, y) for some 0 < y < V 2 . We can write 

y in the form y = yo + y\V for uniquely determined yo,2/i with 0 < yo,yi < V. For i = 0,1, we 

recursively define < pi(T') = (0j(Tq), j/j). Thus, < pi(T') is an (h — 1, V, a)-tree that is similar to T' 

but where we replace the top edge weight y by edge weights yt and, recursively, do the same for all 
(h — 2, V 2 /a 2 , a 2 )-subtrees. Note also that the corresponding edge weight V 2 — y in T' automatically is 
replaced by the edge weight V — yi in (j)i(T') in order for (j)i(T') to be an (h — 1, V, a)- tree. 

Denote by <fi(v) the leaf in <j>i(T') corresponding to the leaf v in T'. 

Consider now the (h, W, a)-tree T = (^(T'), ^i(T'), x). Every leaf v in T' corresponds to the leaves 
cj>o(v), <fi(y) in T where <fi{v) € 4>i{T ') for i = 0,1. Using formula (2) for the distances in T', it is 
straightforward to see that 

dist T '(u,v) = (dist^ o(T q(0 o (M),0o(^)) mod (2U)) + Udist^ l(T /)(^i(u), <f>i(v)). 

We can now apply the leaf distance labeling scheme a to T and obtain a label for each leaf node in 
T. In particular, the pair of leaves (<j>o(v), 4>i(v)) corresponding to a node v in T 1 will receive a pair of 
labels. We use this pair to label v in T ’, whereby we have obtained a labeling of the leaves in T 1 with 
labels from S(x). Using the formula in (5.2) we can construct a decoder that can compute the distance 
between two nodes in T' using these labels alone, and hence we have obtained a leaf distance labeling 
scheme for (h — 1, V 2 , a 2 )-trees using only labels from S(x ) as desired. □ 
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Lemma 5.4. For all h > 1 and W >2, g(h,W,a) > ■ 

Proof. The proof is by induction on h. For h = 1 we note that an (0, IT, a)- tree has only one node, 
so that g(0, W 2 /a 2 , a 2 ) = 1. Lemma 5.3 therefore yields g(l, IT, a) 2 > IT from which it follows that 
g(l,W,a) > \/W. The lemma therefore holds for h = 1. Now let h > 1 and assume that the lemma 
holds for h — 1. Lemma 5.3 and the induction hypothesis now yield 

g(h, IT, a f > Wg(h - 1, IT 2 /a 2 , a 2 ) 

> w iW 2 /a 2 ) (fe - 1)/2 

a 2(h-l)(h-2)/4 

- Wk 
Qh(h— 1)/2 

from which it follows that g(h, W, a) > J^h-\)iA as desired. □ 

The previous lemma implies that any (leaf and hence also ordinary) distance labeling scheme for 
(h, IT, a) -trees must have labels with worst-case length at least | (log IT — -^-loga) = \h\ogW — 
\h 2 \oga + \hloga. Since the number of nodes in such a tree is n = 3 • 2 h — 2, it follows that h = 
log(n + 2) — log 3, and hence that logn — 2 < h < log n for sufficiently large n. From this we see that 
the worst case label length is at least 

- log n log W — - log n(log n — 1) log a — log W — — log a. 

In the case where a = 1, we retrieve the bound of \ logn log IT — log IT obtained in [36]. It seems that 
larger values of a only makes the above result weaker, but the the real strength of the above becomes 
apparent when we switch to the unweighted version of ( h , IT, a)-trees, in which we replace weighted 
edges by paths of similar lenghts. Note that a distance labeling scheme for the family of unweighted 
( h , IT, a)-trees can be used as a distance labeling scheme for the weighted ( h , IT, a)-trees, and hence any 
lower bound in the weighted version automatcially becomes a lower bound in the unweighted version. 
The number of nodes n in an unweighted ( h , IT, a)-tree is upper bounded by 

n < 2W + 2 • 2W/a + 2 2 • 2IT/a 2 + • • • + 2' 1 " 1 • 2W/a h ~ 1 + 1 

In the case a = 2, we get n < 2Wh + 1. 

Theorem 5.5. Any distance labeling scheme for unweighted (h, IT, 2) -trees, and hence also for general 
trees, has a worst-case label size of at least \ log 2 n — O(logn). 

Proof. Choose the largest integer h with 2 • 2 h h + 1 < n, and note that we must have h > logn — 
O(loglogn). Set IT = 2 h and consider the family of (h, W, 2)-trees, which is a subfamily of the family 
of trees with n nodes. From Lemma 5.4 it therefore follows that the worst-case label length is 

log IT — hr 2 + = jh 2 + \ h = j log 2 n + j logn - 0(log logn). □ 

6 Approximate distances in trees 

In this section we present a (1 + e)-stretch distance labeling schemes with labels of size O(logn). 
Theorem 6.1. For constant e > 0, 1 + e stretch labeling scheme use ©(logn) bits. 
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Proof. As in the case of exact distances, we will create labels of size O(logn) bits that contain the 
extended NCA labels from Lemma 5.1 as well as distroot. We will also be using the formula in (1). 
However we can not afford to store exact distance to each apex ancestor. Even storing an 2-approximate 
distance to each apex ancestor would require log n log log n bits. Furthermore, given approximate dis¬ 
tance to the apex nodes does not directly guarantee upper bound for the approximate distance, as we 
in equation (1) are using subtractions. We will in the following address these two problems. 

Let w = nca (u,v) and assume w 0 {u,u}, since otherwise we can compute the exact distance using 
only distroot. Suppose we know a (1 + ^-approximation a of dist(rc,u) for some e > 0. That is, 

dist(io,v) < a < (1 + e) dist(u/,u). (3) 

Define d = distroot (u) — distroot(u) + 2a. First we show that d is a (1 + 2e)-approximation of dist(u, v). 
Next we show how to represent all the (1 + ^-approximate distances to light ancestors for a node using 
a total of O(logn) bits. Together with formula (1), these two facts prove that we can compute (1 + 2e)- 
stretch distances between any pair of nodes with labels of size O(logn). To prove the theorem, we can 
then simple replace e by \e. 

To see that d is a (1 + 2e)-approximation of dist(u,u), first note that 

d = distroot(u) — distroot(u) + 2 a> distroot( - u) — distroot (u) + 2dist(tc,u) = dist (u,v). 

For the other inequality, note that 

d = distroot («) — distroot (v) + 2a 

< distroot (it) — distroot(u) + 2(1 + e) dist(rc, v) 

= distroot(n) — (distroot(u) — dist(tu, v)) + (1 + 2e) dist(tc, v) 

= distroot (it) — distroot(rc) + (1 + 2e) dist(u>, v) 

= dist(u, w) + (1 + 2e) dist(u>, v) 

< (1 + 2e) (dist(u, w) + dist(ru, v)) 

= (1 + 2e) dist(xi, v) . 

It now only remains to show that we can compactly store all the approximate distances a to light 
ancestors using O(logra) bits space. 

We use a heavy light path decomposition of the tree. For each node v we can save a 2 approximate 
distance to all its k proper light ancestors node as follows. Let S be a binary string initially with k 
zeros. Before each 0 we now inserts Is such that, if we have j Is in total from the beginning of S and 
to the rth 0, then the distance to the ith light ancestor a of v satisfies that 2^~ l < dist(u, a) < 2A This 
is the same as traversing the tree bottom-up from v and, for each light node encountered on the way, 
adding a 0 and each time the distance doubles adding a 1. The number of Os equal the number of light 
nodes which is at most logn, and the number of Is is also limited by logn since n is the maximum 
distance in the tree. In total the length of S is at most 2logn. 

Using the O(logn) bits label from Lemma 5.1 we can tell if one node is an ancestor of another, and 
if not which one has a light ancestor a that is a child of their nearest common ancestor w. In addition, 
we can determine the total number i of light ancestors up to a. This means that we can compute j. 
and hence the 2-approximation j — 1, as the number of l’s in S until the i’th 0. 

We have now obtained a 2-approximation with labels of size O(logn). We can improve this to a 
(1 + ^-approximation by setting a 1 in S each time the distance increases with 1 + e rather than 2. 
This will increase the label size with a constant factor log ^ +e .) ■ 

This proves that there is a (1 + e)-stretch distance labeling scheme with 0(log?r). To complete the 
proof of the theorem, we note that, given any (1 + e)-stretch distance scheme, we can always distinguish 
nodes (since identical nodes have distance 0), which means that we always need at least n different 
labels, and hence labels of size at least logn bits. □ 
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